Indexing the Results of Sliding Window Queries ∗

نویسندگان

  • Lukasz Golab
  • Piyush Prahladka
  • M. Tamer Özsu
چکیده

A popular method of bounding the memory requirements of queries over data streamsis to use a sliding window, where old data are continuously removed as new data arrive.One problem that has not been addressed previously concerns indexing the results of slidingwindow queries. This is a noteworthy problem because data stream systems often materializecommon sub-expressions or final results of similar queries, therefore it is important to allowthe queries efficient access into a relevant subset of the materialized result. In this paper, wedesign and evaluate indexing methods that take advantage of the temporal order in whichold answers expire for efficient maintenance and querying of the results. Our techniquesallow sharing of indexed results among similar queries, adapt to fluctuating stream arrivalrates, and are experimentally shown to perform updates over twice as fast as the existingsliding window indices.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Indexing Sliding Windows over Online Data Streams

We consider indexing sliding windows in main memory over on-line data streams. Our proposed data structures and query semantics are based on a division of the sliding window into sub-windows. By classifying windowed operators according to their method of execution, we motivate the need for two types of windowed indices: those which provide a list of attribute values and their counts for answeri...

متن کامل

Framework for real-time clustering over sliding windows

Clustering queries over sliding windows require maintaining cluster memberships that change as windows slide. To address this, the Generic 2-phase Continuous Summarization framework (G2CS) utilizes a generation based window maintenance approach where windows are maintained over different time intervals. It provides algorithm independent and efficient sliding mechanisms for clustering queries wh...

متن کامل

An efficient DNA sequence searching method using position specific weighting scheme

Exact match queries, wildcard match queries, and kmismatch queries are widely used in various molecular biology applications including the searching of ESTs (Expressed Sequence Tags) and DNA transcription factors. In this paper, we suggest an efficient indexing and processing mechanism for such queries. Our indexing method places a sliding window at every possible location of a DNA sequence and...

متن کامل

ارائه روشی پویا جهت پاسخ به پرس‌وجوهای پیوسته تجمّعی اقتضایی

Data Streams are infinite, fast, time-stamp data elements which are received explosively. Generally, these elements need to be processed in an online, real-time way. So, algorithms to process data streams and answer queries on these streams are mostly one-pass. The execution of such algorithms has some challenges such as memory limitation, scheduling, and accuracy of answers. They will be more ...

متن کامل

A DNA Index Structure Using Frequency and Position Information of Genetic Alphabet

Exact match queries, wildcard match queries, and kmismatch queries are widely used in lots of molecular biology applications including the searching of ESTs (Expressed Sequence Tag) and DNA transcription factors. In this paper, we suggest an efficient indexing and processing mechanism for such queries. Our indexing method places a sliding window at every possible location of a DNA sequence and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005